Skip to content

Data fixes#36

Merged
kandrosov merged 43 commits intocms-flaf:mainfrom
valeriadamante:data_fixes
Feb 4, 2026
Merged

Data fixes#36
kandrosov merged 43 commits intocms-flaf:mainfrom
valeriadamante:data_fixes

Conversation

@valeriadamante
Copy link
Contributor

This PR is intended because I realized that in Run2024 config the data processes were the 2022 ones

@kandrosov kandrosov added enhancement New feature or request and removed enhancement New feature or request labels Feb 2, 2026
@kandrosov kandrosov added enhancement New feature or request and removed enhancement New feature or request labels Feb 3, 2026
@valeriadamante
Copy link
Contributor Author

@cms-flaf-bot please test

@cms-flaf-bot
Copy link

pipeline#13959841 started

@cms-flaf-bot
Copy link

pipeline#13959841 passed

@kandrosov kandrosov requested a review from Copilot February 4, 2026 20:38
Copy link

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

This PR updates Run 3 configuration and analysis code to align processes, datasets, and global settings with the correct Run 2022–2024 productions, and introduces a configurable low-mass dimuon cut in the analysis baseline.

Changes:

  • Reorganizes DY, EWK, Higgs, TT/ST/TW/TTX, and data processes across Run3 2022/2022EE/2023/2023BPix/2024, including new EWK 2μ2j and detailed VBF/ggH/VH/TTH signal processes, and updates phys_models.yaml accordingly.
  • Extends dataset catalogs for these eras with the appropriate NanoAOD/MiniAOD campaign paths, adds generator and cross-section metadata, and wires new data-taking periods.
  • Adds nano_version and related globals to Run 3 global.yaml files, switches the jet-veto logic to use it, and introduces LowerMassCut in the baseline to enforce an m(μμ) > 45 GeV selection at the RDataFrame level.

Reviewed changes

Copilot reviewed 20 out of 20 changed files in this pull request and generated 5 comments.

Show a summary per file
File Description
config/processes.yaml Defines new DY stitching anchors for all- and single-flavor DY, used by Run 3 process configs.
config/phys_models.yaml Updates base model background and signal process lists to reference the new Run 3 processes.
config/Run3_2024/processes.yaml Aligns DY, W, EWK, Higgs, TT/ST/TW/TTX, signal, and data processes with 2024 datasets and new DY processors.
config/Run3_2024/global.yaml Adds nano_version: v15 and explicit nanoAODVersions for Run3_2024.
config/Run3_2024/datasets.yaml Registers 2024 DY, EWK, Higgs, TTLNu-EWK, and other samples with NanoAOD/MiniAOD paths.
config/Run3_2023BPix/processes.yaml Reworks 2023BPix DY, EWK, signal, and TTX processes to use common DY processors and new EWK/EWK2μ2j entries.
config/Run3_2023BPix/global.yaml Adds nano_version: v12, met_type, and explicit NanoAOD versions for 2023BPix.
config/Run3_2023BPix/datasets.yaml Defines 2023BPix DY, EWK, TTLNu-EWK, and Higgs samples with generator and NanoAOD/MiniAOD metadata.
config/Run3_2023/processes.yaml Mirrors 2023BPix process reorganization for the non-BPix 2023 era.
config/Run3_2023/global.yaml Adds nano_version: v12, met_type, and NanoAOD versions for 2023.
config/Run3_2023/datasets.yaml Adds 2023 DY(105–160), EWK, TTLNu-EWK, and detailed Higgs sample definitions.
config/Run3_2022EE/processes.yaml Reorganizes 2022EE DY, EWK, signal, and data processes similarly to 2023/2023BPix.
config/Run3_2022EE/global.yaml Adds nano_version: v12, met_type, and NanoAOD versions for 2022EE.
config/Run3_2022EE/datasets.yaml Registers 2022EE DY(105–160), EWK, TTLNu-EWK, and Higgs datasets.
config/Run3_2022/processes.yaml Reorganizes 2022 processes: DY via all-flavor processors, EWK2μ2j, Higgs signals, and data groupings.
config/Run3_2022/global.yaml Adds nano_version: v12, met_type, and NanoAOD versions for 2022.
config/Run3_2022/datasets.yaml Adds 2022 DY(105–160), EWK, Higgs samples, and private LLJJ productions with FS paths.
FLAF Updates the FLAF submodule to a newer commit that likely supports the new configs and processing logic.
AnaProd/baseline.py Introduces LowerMassCut to define m(μμ) and apply an m > 45 GeV filter.
AnaProd/anaTupleDef.py Adjusts Jet NanoAOD V12 variable list, switches isV12 detection to nano_version, and applies LowerMassCut in the analysis chain.
Comments suppressed due to low confidence (2)

config/Run3_2023BPix/datasets.yaml:1

  • As in the 2024 config, the generator string for all VBFH amc@nlo samples is misspelled as amcanlo; this should be corrected to amcatnlo so that generator-based logic and metadata remain consistent across eras.
#### DY samples ####

AnaProd/anaTupleDef.py:1

  • Using global_params["nano_version"] assumes every configuration defines this key; to avoid KeyError when running on configs that have not yet been updated (e.g. older eras), it would be safer to access this with a default (or handle its absence) and/or document nano_version as a required field in all global.yaml files.
import AnaProd.baseline as AnaBaseline

@kandrosov kandrosov merged commit cdb5e2a into cms-flaf:main Feb 4, 2026
4 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants